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ABSTRACT 

Since 1985 several procedures for computerized test 
construction using linear programing techniques have been described 
in the literature. To apply these procedures successfully, suitable 
item banks are needed. The problem of designing item ^anks based on 
item response theory (IRT) is addressed. A procedure is presented 
that determines whether an existing item bank meets the test 
construction requirements, if not, the method indicates which items 
have to be added to the banks so that it will meet the requirements. 
The comparison of desired and present item bank characteristics, 
writing, and calibrating items continues until the characteristics of 
the item bank are acceptable. Four categories of characteristics are; 
(1) general characteristics (such as format); (2) subject matter 
characteristics (such as learning objective); (3) psychometric 
characteristics (such as IRT-parameters) ; and (4) user statistics. 
One figure illustrates the procedure. A 19-item list of references is 
included . ( Author/SLD) 
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Abstract 

Since 1985 several procedures for computerized test 
construction using linear programming techniques have been 
described in the literature. To apply these procedures 
successfully in practice, suitable item banks are needed. 

In this paper the problem, of designing IRT-based item 
banks is addressed. A procedure is presented that, for an 
existing item bank/ determines whether it meets the tests 
construction requirements. If not, the method indicates which 
items have to be added to the bank, so that it will meet the 
requirements . 

Key words: Item response theory, Rasch model linear 
programming, item banking, test construction, 
item bank design, cluster-based test 
construction, information functions • 
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A Method for Designing IRT-based Item Banks 



An item bank is a large set of items stored with their item 
characteristics. The use of item banks for test construction 
is becoming more and more popular. Especially the application 
of (integer) linear programming methods to test construction 
problems turns out to be promising (Adema, 1990a, 1990b, 
1990c; Adema, Boekkooi-Timminga, ^ van der Linden, in press; 
Adema & van der Linden, 1989; Baker, Cohen & Barmish, 1988; 
Boekkooi-Timminga, 1987, 1989, 1990a, 1990b; de Gruijter, in 
press; Theunissen, 1985, 1986; van der Linden & Boekkooi- 
Timminga, 1989) ♦ In contrast to the large amount of attention 
paid to procedures for test construction, the problem of 
designing item banks was rather ignored. Present guide-lines 
for designing item banks regard tho quality of the ir>dividual 
items, and, for the total collection of items, for instance, 
the spread of items over objectives, skills, and tasks. No 
guide-lines are provided for designing suitable IRT-based 
item banks to which the modern test construction methods can 
be applied successfully • The importance of well-designed item 
banks is clear. For instance, an item bank may become 
exhausted after some time, meaning that no longer 
satisfactory tests can be constx'ucted from it, because a 
large number of items is excluded from selection on basis of 
their previous usage. Also, the psychometric qua]ity of tests 
constructed later may decrease. 
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Here the problem of assessing the practical usefulness 
of Item banks is addressed. A method Is proposed that 
compares the desired features of the Item bank with 
present features. To determine the desired features of an 
item bank, it is assumed that tests will be selected from the 
item bank using (Integer) linear programming techniques. 
Furthermore, it is assumed that the Rasch model holds. In the 
discussion of this paper, it is argued why other IRT-models 
are not considered. 

In this paper, first, an outline of the item bank design 
method is given, and the notation used is summarized. Then, 
the Basic Method is described. Next, it is described how the 
Basic Method can be adapted if additional practical 
constraints have to be considered for the tests. The paper 
ends with a discussion. 

Outline of the Item Bank Design Method 

It is assumed that tlie item banks to be examined fit the 
Rasch model. The question is, whether an existing item bank 
will be capable to handle a series of test construction 
requests satisfactory . 

In short the method works as follows: Flrsn, the 
specifications of all tests to be selected from the item oank 
have to be given. Next, the numbers and characteristics of 
the items needed in the bank, to be able to construct these 
tests, are determined and compared with the characteristics 
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of the present it'am bank. If the item bank does not fit the 
needs, the method determines the characteristics of items 
that have to be added to the item bank, such that it will 
become suitable. After these items have been constructed and 
calibrated, the present and the desired item bank are 
compared again^ This process of comparing the desired and 
present item bank, writing and calibrating items continues 
until the characteristics of the item bank are acceptable. 

Four categories of item characteristics, that can be 
stored in an item bank, are distinguished (van Thiel & 
Zwarts, 1986) ; (1) general characteristics, e.g. item format, 

(2) subject matter characteristics, e.g. learning objective, 

(3) psychometric characteristics, e.g. IRT-parameters, and 

(4) user statistics, e.g. date of last use. Stodola (1974) 
gives a thorough overview of the individual item 
characteristics of interest for item banking. It is not 
necessary to consider constraints on all of these item 
characteristics in the test specifications to be given in the 
first step of thG item bank design method. One reason is that 
a pra::x:itioner often can not formulate such detailed 
requirements, when the actual construction of the tests is 
not under discussion. The second reason is that, because uni- 
dimensional item banks are considered, items differ only with 
respect to a small number of characteristics. The most 
important differences are psychometric. Therefore the main 
emphasis in this paper will be on the psychometric 
characteristics of the items . The information function is 
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used to represent these psychometric characteristics. A 
definition of the information function can be found, for 
instance, in Birnbaum (1968) or Lord (1980) ♦ 

Notation 

The symbol-: used are given in alphabetical order. 
Basic Method 

b^ difficulty parameter of items located within 

ability interval c. 
c 1, C ability intervals. 

g = 1, G groups of tests not allowed to have any items 

in common. 

set of ability intervals with s^^ ^ ^ 
test t. 

I(s(6j^) information of an item located within ability 

interval c at ability level B^. 

j = 1, J item characteristics. 

k = 1, K ability points. 

set of ability intervals with v^c < 0 for 
test t. 

Itj the shortage of items reflecting item 

characteristic j in the item bank for test t. 

i^c^ "^cj help variable used for computing the number 

of items (reflecting item characteristic j) 
to be added to ability interval c of the item 
bank. 
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test length. 

number of items in test reflecting item 
characteristic j. 

final number of items (reflecting item 
characteristic j) to be added to ability 
interval c of the item bank. 

relative height of the test information 
function at ability level k. 

number of items (reflecting item 
characteristic j) needed in interval c for 
Group g of non-overlapping tests, 
number of items needed in ability interval c 
for test t* 

number of items reflecting item 
characteristic j needed for test t. 
tests to be constructed. 

number of items reflecting item 
characteristic j needed in interval c for 
test t. 

set of tests t belonging to Group g. 
number of items (reflecting item 
characteristic j) to be added to ability 
interval c so that the tests in Group g can 
be constructed best (if Vg^, ^gcj ^ • 
number of items to be added to ability 
interval c of the item bank so that test t 
can be constructed best (if v^q < 0) . 

■ 1 
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w^j number of Items reflecting Item 

characteristic j available In the Item bank 

In the ability Intervals In Hf 
Kq, x^j number of Items (reflecting Item 

characteristic j) In ability Interval c 

needed for the test, 
y additional decision variable used to maximize 

the relative test information function, 
z^, z^j number of items (reflecting item 

characteristic j) available in ability 

interval c of the item bank. 

Basic Method 

It is assumed that the ability continuum is partitioned into 
C (c = 1, 2, C) Intervals. Ic(6)^) represents the item 

information function value at ability level of all items, 
having difficulty parameter values within ability Interval c. 
Taking the midpoint of each ability Interval c as the item 
difficulty value b^, ^c^^k^ ^® computed as follows: 

(1) lc(Ok) = [exp(e)^ - be) (1 + exp(-e,^ + ^c)'i^'^• 
The five steps of the Basic Method are outlined below. 
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St,e p 1 , Determine the characteristics of the desired item 
bank. 

Make an overview of the desired types and numbers of tests to 
be constructed from the item bank. Groups (g = 1, 2, G) 
of tests that are not allowed to have any overlapping items, 
are identified. Tests may be part of several groups. If tests 
are allowed to have some items in common, the practitioner 
should decide whether the tests have to be treated as tests 
that may have all or no items in common. 

For each test the number of items, x^, that should be 
available in each ability interval c of the item bank, is 
determined, so that a test fitting the specifications best 
can be selected from the bank. Below it is described how this 
is done. Next, for each Group g the number of items Sg^ 
needed in interval c for this group, are computed by summing 
the Xc's of all tests in this group. 

The tests are "constructed" according to the test 
specifications as follows. It is assumed that the item bank 
contains an infinite number of items in each ability interval 
c. It is free to choose any linear programming model for the 
"construction" of a test, as long as the following decision 
variables are used: x^, indicating the number of items from 
interval c to be included in the test. Two examples of test 
construction models are the model of minimum test length 
(Theunissen, 1985) and the Maximin Model (van der Linden and 
Boekkooi-Titiminga, 1989) . As the ability continuum is assumed 
to be partitioned into C intervals, actually, the cluster 
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based method proposed by Boekkooi-Timminga (1990b) is applied 
here. The cluster-based method assumes that the items in the 
bank have been grouped according to their item information 
functions, such that items within a cluster can be considered 
equivalent. In case of the Maximin Model the test 
"construction" problem is formulated as follows: 

(2) maximize y, 
subject to 

C 

(3) 1 Ic(t'k)^c - rj^y i 0, k = 1, 2, K, 
c=l 

C 

(4) I Xc = N, 
c=l 

(5) Xc ^ 0, c = 1, 2, . . . ; C, 

(6) y ^ 0, 

The decision variable x^ gives the number of items needed in 
ability interval c of the item bank to be able to construct 
the test. The rj^'s in (3) give the relative heights of the 
test information function at the ability levels k (= 1, 2, 
K) , specified by the practitioner. By maxiriizing 
decision variable y in (2), the lower bounds r]^y to the test 
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information at the K ability levels considered are maximized. 
Constraint (4) fixes the test length at N. The bounds on the 
decision variables are set in (5) and (6) . 

In this model the maximum number of items to be selected 
from each ability interval c is unbounded (x^ ^ 0 in 
Expression [5]), thus the item bank is infinite. If it occurs 
that there are intervals for which no items can be written^ 
the corresponding x^'s should be set to zero in (5) • Note 
that the decision variables x^ are not restricted to take 
integer values • Thus, standard linear programming problems 
that can be solved quickly are the result- 

Model (2) - (6) is a basic model that may be extended by 
constraints that can be formulated as linear expressions of 
the decision variables x^, where x^ is defined as above. For 
instance, exact 10 items have to be included in the test from 
the ability intervals 1 to 5, 

Step 2 . Deter;nine the characteristics of the present item 
bank. 

Compute the number of items, z^, available in each ability 
interval c of the present item bank. 

Step 3 . Compute the differences between the desired and 
present item bank. 

In order to be able to select the tests in Group g from the 
item bank, it is necessary that the number of items Zp 
available in each ability interval of the present item bank 
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is larger than the required numbers Sg^, Now compute for each 
Group g the difference, Vg^, between Zq and Sg^ at each 
ability interval through: 

(7) VgQ ^ ^gc^ ^ ^ ^' ^' C# 

g « 1, 2, . • . I G. 

If Vgc ^ 0 for all groups and all ability intervals, the 
present item bank is perfectly suited for the test 
construction needs stated, and the procedure is STOPPER , A 
negative Vg^ value gives the numbers of items to be added to 
ability interval c of the item bank, such that it will become 
possible to select the tests in Group g best. 

Step 4 , Determine the numbers of itemSf to be added to 

each ability interval of the item bank. 

In this step, first, the lowest values, m^, of the Vg^'s over 
the Groups g are obtained as follows: 

(8) m^ = minimum {Vg^}, for c=l, 2, C. 

g = 1, . , • , G 

If m^ is negative, tm^| items should be added to this ability 
interval of the item bank. If it is positive, m^ gives the 
number of items left in this interval. 

The desire to select the most perfect tests according to 
the test specifications, might not be too strong in practice- 
If this is the case, it is possible to compensate for the 
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shortage of items in an interval by items from adjacent 
intervals. This option is of interest, considering the fact 
that it is difficult for item writers, to construct exactly 
the items required. Another consideration is that, if narrow 
intervals are used and no compensation is allowed, it will 
turn out tnat, compared to a situation with wider intervals, 
a larger /mmber of items will have to be added to the bank. 
If no comp'-v.saticn should be considered n^ *= |min{0,mQ)| for 
all c, and Goto Step 5, 

Items in intervals with > 0 can be used to compensate 
a shortage of items in an adjacent interval, if the amount of 
information in these intervals, at the midpoint of the 
interval with the shortage, is at least equal to the amount 
of information the n^ items that had to be added would 
provide. It is obvious that the number of items needed for 
compensation from adjacent intervals, is larger than n^. 
Also, the wider the intervals the more items are needed for 
compensation, and^ the less perfect the tests will be that 
aro constructed from the item bank at a later stage. This is 
caused by the fac". that the midpoints of the intervals, are 
used as tl;e item difficulty values for computing the 
information values of the items within a cluster. Note that 
the m^^ items left in an interval can only be used once for 
compensa*:ion purposes. 
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Step 5 . Write items to be added to the item bank. 
IteiTi writers should try to construct the n^ items as 
determined by Step 4 for each ability interval c. Next, the 
items are calibrated. Because in practice, it will never be 
possible to construct exactly the items that were desired, 
GotiQ Step 2 and compare the new item bank with the desired 
one • 

The suitability of a bank with percentage measurement items, 
developed by the National Institute for Educational 
Measurement in the Netherlands (Cito) , was checked. The item 
bank consisted of 470 items, after a Rasch calibration 416 
items remained. Ability intervals were formed by taking 
widths of 0.6 on the ability continuum from -4.5 to 4.5 
logits; 17 intervals were the result. 

The Basic Method was applied. Three groups of non- 
overlapping tests were taken into account. Group 1 consisted 
of four tests, all having the same test specifications: 
Minimal test length, and target test information values at 6 
= -1, 0, 1 , 2 of 4, 8, 8, 4, respectively. So the model 
proposed by Theunissen (1985) was used. Three selective tests 
were included in Group 2 . each test consisted of 30 items, 
and test information was maximized for Test 1 at 6 = -1, for 
Test 2 at e = 0, and for Test 3 at 6 = 1 • The five tests in 
Group 3 consisted also of 30 items, they were constructed 
using the Maximin Model, considering the ability points 9 = - 
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2, -1, 0, 1/2 for which the relative target information 
values were equal* 

The mathematical programming package PcProg 
(Quantitative Management Software, 1986) wfiS used, on a MsDos 
personal computer under 8 MHz with mathematical co-processor 
and hard disk, to "construct" the tests in Step 1, The 
optimization times needed for "constructing" the tests ranged 
from 0.2 to 0.6 seconds, the computation times for the matrix 
generation ranged from 4.70 to 9.30 seconds. In Figure 1 the 
results of Step 1, 2, 3, and 4 are shown. The first three 
lines indicate the numbers of items needed in each of the 
ability intervals to be able to construct the tests in each 
group as optimal as possible (Step 1) . The fourth line gives 
the numbers of items available in each interval oi the 
present item bank (Step 2) . Lines 5-7 show the differences 
between Line 4 and each of the lines 1-3 (Step 3) . The 
total difference between the present item bank and the 
derired one is shown in Line 8 (Step 4) . The negative values 
indicate the numbers of items to be added to the item bank to 
become suited. 



Insert Figure 1 about here 



From Figure 1 it i. obvious that the shortage of items 
could not be completely compensated from the adjacent ability 
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intervals, as the number of -42 could not be compensated by 
the number of Items beforehand In Its adjacent Intervals (= 
18) . After compensation It turned out that still 26 Items had 
to be constructed for this n^.erval. The shortage of Items In 
the other two Intervals could be completely compensated. The 
52 items could be compensated by considering all 34 items 
from the lower adjacent Interval and 23 items (out of 24) 
from the upper Interval. The 20 items could be compensated by 
the remaining item from its lower Interval, and 22 (out of 
23) items from the upper Interval. 



Additional Practical Constraints 



Suppose that beside psychometric constraints, that can be 
formulated using the decision variables in (2) - (6), 

other practical constraints (see van der Linden & Boekkool- 
Timminga, 1989) need to be considered in the test 
specifications of Step 1. Then, the Basic Method can be 
generalized in two manners depending on the nature of the 
item characteristics considered in these constraints. The 
generalizations called Procedures 1 and 2 are described next. 



Procedure 1 

Observing the items on a certain item characteristic in the 
present item bank, it is expected that items can be written 
for each individual ability interval, if the present items 
one contained in almost all abil ity intervals . Practical 
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constraints on such item characteristics can be treated by 
adapting the Basic Method as follows. 

In step 1 . first, formulate the test specifications as 
is done in the Basic Method. Additional specifications regard 
the numbers of items reflecting certain item characteristics 
that have to be included in each test. It is possible that 
the items in the bank reflect more than one of these 
characteristics. If this is the case, each combination of 
characteristics that may occur, is treated like a new 
characteristic. Thus, the resulting set of item 
characteristics (j =1, 2, J) is mutually exclusive. 

Next, the numbers of items, s^j, reflecting item 
characteristic j that are needed for t^sts t are given by the 
practitioner. Then, the x^'s are obtained for each test, as 
described for the Basic Method. Note that the constraints on 
the item characteristics are not regarded yet. The number of 
items, s^c' needed in interval c for test t is set equal to 
x^. Let be the collection of ability intervals with s^^ ^ 
0 for test t. Determine in step 2 the z^'s as described for 
the Basic Method, and the numbers of items, z^j^ on each item 
characteristic available in ability interval c of the item 
bank. Next, compute w^j for each test t, indicating the 
numbers of items reflecting item characteristic j available 
in the ability intervals in of the item bank, through: 

(9) w^i = S Zc-j, j = 1. 2, . . J, 

C€Ht 

t ~ If 2, T» 
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In Step 3 the differences v^^ between and s^^ are 
obtained by taking v^^ = - s^q at all ability intervals 
for each test t. Also, the differences between s^j and w^j 
are determined for each item characteristic j and each test 
t. These differences, l^j = I min{ 0, w^ j-s^ j } I give the 
shortage of items on characteristic j for test t. 

Step 4 > If v^^, < 0 then Iv^^l gives the number of items 
to be added to interval c of the item bank. Again, 
compensation can be used, if this is done and l^j should 
be adapted. Given the Itj'^ Intel's (v^^ < 0), the 

practitioner has to decide on the number of items, t^^j, on 
characteristic j that have to be added to interval c for test 
t. It is obvious that these numbers can be chosen in several 
ways, there is no unique solution. While choosing these 
numbers the practitioner should keep an eye on the spread of 
items, reflecting the characteristic of interest, in the 
present item bank over the ability continuum. He/she should 
fix the numbers t^^j such that it is expected that these 
items can easily be constructed, because several items with 
approximately the same chsHraoteristics are already included 
in the item bank. 

Two possibilities are distinguished for each test t: 
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J 

(a) I Ivtcl < ^ Itj' a^cl 

J 

(b) I Ivtcl ^ Itj' 
ceKt j=l 

where is the collection of ability intervals for which v^^ 
< 0. In case of (b) the t^^j'^ chosen such that the items 
l^j are spread over the ability intervals such that at most 
Iv^cl items have to be constructed for interval c. For case 
(a), part of the l^j items have to be constructed such that 
the required numbers Iv^cl obtained. The remaining items 

to be added to the bank should be constructed such that the 
numbers of items, reflecting the item characteristics of 
interest, added to the bank in each of the ability intervals 
in is not larger than s^^ determined in Step 1. 

At this point of Procedure 1, the groups 
g(= 1, 2, G) of non-overlapping tests identified in Step 

1 are regarded. Given the t^cj's ^or all tests within Group g 
(Vg) , the numbers of items, Sg^j. reflecting characteristic j 
to be added to ability interval c of the item bank, can be 
computed as follows; 

g= 1, 2 , G, 
c^ 1, 2 f • • • f 0, 
j = l, 2 , • * • , J. 



(10) Sgcj = I ttcj/ 

teVg 
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After the Sg^j's have been obtained for each Group q, 
the n^j's, giving the number of Items on characteristic j to 

be added to ability Interval c of the bank, are obtained 
through: 

(11) n^j = maximum ^^gcj^' c « 1, 2, C, 
g = 1 , • • • # G 

j ~ 1# 2# J« 

In Step 5 Item writers should try to construct these n^j 
Items . 

Proredure 2 

Procedure 2 should be used If Item characteristics have to be 
considered In the practical constraints that represent Items 
that can not be constructed for a number of ability 
Intervals. This Is the case, for Instance, If Items regarding 
a certain learning objective all have low difficulty values. 
Procedure 2 should also be applied If typical test 
specifications have to be taken Into account, for Instance, 
sped flcat Ions regarding relational aspects between Item 
characteristics. It Is emphasized that Procedure 1 should be 
applied as much as possible, because the Item bank design 
process will be simpler. Only Item characteristics that 
d^=*finitely will cause problems applying Procedure 1 should be 
treated by Procedure 2. 

The Basic Method Is adapted as follows. As for Procedure 
1, the item bank designer decides In gt;ep ; which item 
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characteristics (j = 1, 2, J) have to be taken into 

account. Next, the decision variables are replaced by Xj,j. 
The Xcj's denote the numbers of items selected for the test 
from ability interval c that reflect item characteristic j. 
Model (2) - (6) is generalized as follows: 

(12) maximize y, 
subject to 

C J 

(13) I I Ic<®)c)**ci " k " 1, 2, K, 
c-1 j=l 

C J 

(14) I I Xp-) = N, 
c=l j=l 

C 

(15) I Xcj = Nj, j = 1, 2, . . J, 

c=l 

(16) 0 ^ Xcj i Ucj» c = 1, 2, ...» C, 

j ~ 1, 2 1 J f 

(17) y ^ 0. 

As an example, the cons':raints in (15) state that exact Nj 
items reflecting characteristic j have to be included in the 
test . Note that, if the above model is applied, it is 



Designing IRT-based Item Banks 

21 



possible that several solutions with the same objective 
function value can be obtained, especially if u^j is large, 
so there is no unique solution* Application of the above 
model is only meaningful if several small u^j values are to 
be considered, or if many practical constraints like those in 
(15) are included in the model. Otherwise Procedure 1 should 
be applied. 

Now for each Group g of non-overlapping tests, the x^j's 
are summed for each item characteristic j to get the 
corresponding Sg^j, defining for Group g the number of items 
on characteristic j needed in ability interval c. In Step 2, 
the z^j's, indicating the numbers of items on characteristic 
j included in ability interval c, are determined for the 
present item bank. 

In Step 3 the differences between the Sg^j's and Zcj's 
are computed for all ability intervals and all item 
characteristics of interest. This is done for each Group g of 
non-overlapping tests. 

(18} Vgcj = - Sgcj' 9*1/ 2, G, 

C 1, 2, .««, 
j=l, 2 , J. 

If all Vgcj's are all positive, the present item bank is 
perfect . 

In Step 4 of the Basic Method m^ is replaced by m^j. The 
m^j's are determined as follows: 
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(19) m^j = minimum {^g^j), for c = 1, 2, C, 

g = 1 1 • • * 9 G 

]j ^ \ f 2 f «*tf vTt 

Finally, n^j = |min{0,mcj) I # where n^j is the number of items 
reflecting item characteristic j that has to be added to 
ability interval c. Again compensation from adjacent ability 
intervals can be applied. Then, each item characteristic is 
considered separately. 

In step 5 item writers should try to construct the n^j 
items indicated by Step 4. 

Discussion 

In this paper a method for item bank design has been 
proposed. Its purpose is to check whether an existing IRT- 
based item bank can fulfil a series of test construction 
desires. If not, it indicates which items should be added to 
the item bank, such that it becomes satisfactory. The method 
is interest when item banks are developed that have to be 
satisfactory for a certain period without being extended. 

Here it was assumed that the Rasch model holds, however, 
the method can also be applied to the three-parameter 
logistic model. In this case the items in the bank have to be 
divide::: over mutually exclusive groups (clusters) , such that 
the items in the same cluster have fairly equal information 
functions. Next, the ability intervals c considered in the 
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method are replaced by these clusters. Considering the two- 
or three-parameter logistic model instead of the Rasch model 
will be more complicated. First, there will be much more 
clusters for these models, than ability intervals for the 
Rasch model . Second, the compensation process will be more 
complicated, and it will be even more difficult for item 
writers to construct items belonging to an explicit cluster. 

In this paper linear programming models instead of 
integer linear programming models were formulated, thus, it 
is expected that some decision variable values will have 
fractional parts. It might appear contradictory that the 
decision variables Xq and x^j reflect numbers of items 
(integers) , however, a linear programming approach is 
justifiable for this application. The reason for preferring 
the linear approach is obvious because linear programming 
problems can be solved very quickly compared to integer 
linear programming problems. This is very important because 
several linear programming problems have to be solved when 
the method is applied. Research on test construction by 
linear programming showed that the tests obtained by rounding 
fractional decision variable values are fairly optimal in 
many cases (e.g. Boekkooi-Timminga . 1989, 1990b) . This was 
especially the case for the Rasch model. So only small errors 
are made when linear instead of integer linea; programming is 
used. The errors that are made by rounding the fractional 
values, cause that a little more items have to be 
constructed, because all fractional values are rounded 
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upwards. However, as it will be difficult for test 
constructors to give the exact test specifications for tests 
in advance (Step 1), it is acceptable thcit the 
appropriateness of the item bank is tested relatively rough. 
Thus, the errors made are quite acceptable. 

It will be difficult in practice to construct the 
desired items in Step 5. If it turns out that unrealistic 
large numbers of items have to be added to the bank there are 
three options . One is to relax the non-overlapping 
requirements of ti^sts in Step 1, such that less tests are 
required to have no overlap. Another is to put upper bounds 
on the numbers of items that may be selected from certain 
ability intervals in Step 1. A consequence of this approach 
\s that, when the actual tests are selected from the item 
bank at a later stage, the same upper bounds are required, 
otherwise the problem of item bank exhaustion might turn up 
again. The third option is to consider not only items from 
adjacent ability intervals for compensation in Step 4, but 
also those from other intervals. However, care should be 
taken doing this, because the tests that will be constructed 
from this iten bank will become less perfect, and the 
possibility of an exhausted item bank becomes actual again. 

It is also possible that large numbers of items have to 
be added to the item bank, because the test specifications 
are unrealistic for the item bank . For instance, if only 
difficult items can be constructed for the item bank, while 
only easy items are required for the tests, in this case the 
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desired items can not be written, and, if co^^^-^nsation is 
allowed, large numbers of difficult items will have to be 
added to get the amount of information required at the low 
ability levels. However, it is obvious that this approach is 
not to be recommended. 

The compensation process mentioned in this paper is part 
of Step 4. However, it is also possible to include it into 
Step 3. This might be '^f interest, if it is required to have 
different ranges of compensation allowed for the respective 
groups of non-overlapping tests. For instance, for Group 1 no 
compensation is allowed, while for Group 2 compensation is 
allowed from both adjacent ability intervals. 
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Figure Captions 

Figure l ! Testing the Suitability of a Percentage Measurement 
Item Bank* 
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